Spam email categorization with nlp and using federated deep learning

Ul Haq, Ikram; Black, Paul; Gondal, Iqbal; Kamruzzaman, Joarder; Watters, Paul; Kayes, A.

Title: Spam email categorization with nlp and using federated deep learning
Creator: Ul Haq, Ikram; Black, Paul; Gondal, Iqbal; Kamruzzaman, Joarder; Watters, Paul; Kayes, A.
Date: 2022
Type: Text; Conference paper
Identifier: http://researchonline.federation.edu.au/vital/access/HandleResolver/1959.17/190671
Identifier: vital:17690
Identifier: https://doi.org/10.1007/978-3-031-22137-8_2
Identifier: ISBN:0302-9743 (ISSN); 9783031221361 (ISBN)
Abstract: Emails are the most popular and efficient communication method that makes them vulnerable to misuse. Federated learning (FL) provides a decentralized machine learning (ML) model, where a central server coordinates clients that collaboratively train a shared ML model. This paper proposes Federated Phishing Filtering (FPF) technique based on federated learning, natural language processing, and deep learning. FL for intelligent algorithms fuses trained models of ML algorithms from multiple sites for collective learning. This approach improves ML performance by utilizing large collective training data sets across the corporate client base, resulting in higher phishing email detection accuracy. FPF techniques preserve email privacy using local feature extraction on client email servers. Thus, the contents of emails do not need to be transmitted across the network or stored on third-party servers. We have applied FL and Natural Language Processing (NLP) for email phishing detection. This technique provides four training modes that perform FL without sharing email content. Our research categorizes emails as benign, spam, and phishing. Empirical evaluations with publicly available datasets show that accuracy is improved by the use of our Federated Deep Learning model. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Publisher: Springer Science and Business Media Deutschland GmbH
Relation: 18th International Conference on Advanced Data Mining and Applications, ADMA 2022, Brisbane, Australia, 28-30 November 2022, Advanced Data Mining and Applications, 18th International Conference, ADMA 2022 Vol. 13726 LNAI, p. 15-27
Rights: All metadata describing materials held in, or linked to, the repository is freely available under a CC0 licence
Subject: Deep learning; Federated learning; Incremental learning; Model averaging; Phishing detection; Privacy-preserving; Spam detection; TF/IDF
Reviewed
Funder: Oceania Cyber Security

Hits: 796
Visitors: 679
Downloads: 0

		Thumbnail	File	Description	Size	Format